Rank | Count | Beginning |
---|---|---|
2587 | 913 | Die |
1661 | 368 | Das |
2164 | 368 | Der |
7612 | 314 | Sie |
9439 | 276 | Wir |
5839 | 218 | In |
4506 | 217 | Es |
4903 | 144 | Für |
5689 | 143 | Im |
3288 | 142 | Diese |
4028 | 139 | Ein |
6667 | 122 | Mit |
4060 | 112 | Eine |
780 | 106 | Auf |
1033 | 105 | Bei |
5349 | 102 | Hier |
5524 | 99 | Ich |
8541 | 96 | Unsere |
687 | 93 | Auch |
9204 | 89 | Wenn |
8433 | 83 | Und |
6852 | 77 | Nach |
7958 | 77 | So |
3256 | 75 | Dies |
479 | 73 | Am |
8528 | 71 | Unser |
8354 | 69 | Um |
4399 | 60 | Er |
7510 | 60 | Seit |
318 | 58 | Alle |
In the next four subsections show the most frequent sentence beginnings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the beginning of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', 1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.1 Most Frequent Sentence Endings I
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV